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In the tollbooth problem, we are given a tree T = (V, E) with n edges, and a 
set of m customers, each of whom is interested in purchasing a path on the tree. 
Each customer has a fixed budget, and the objective is to price the edges of T 
such that the total revenue made by selling the paths to the customers that can 



^ ■ afford them is maximized. An important special case of this problem, known as 

the highway problem, is when T is restricted to be a line. 

For the tollbooth problem, we present a randomized 0(logn)-approxirnation, 
improving on the current best 0(logm)-approximation. We also study a special 
case of the tollbooth problem, when all the paths that customers are interested 



> 
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I in purchasing go towards a fixed root of T. In this case, we present an algorithm 

that returns a (1 — ^-approximation, for any e > 0, and runs in quasi-polynomial 
time. On the other hand, we rule out the existence of an FPTAS by showing 
that even for the line case, the problem is strongly NP-hard. Finally, we show 
<^ > that in the coupon model, when we allow some items to be priced below zero to 

C^N . improve the overall profit, the problem becomes even APX-hard. 

> '■ 

Consider the problem of pricing the bandwidth along the links of a network such that 
the revenue obtained from customers interested in buying bandwidth along certain 
paths in the network is maximized. Suppose that each customer declares a set of 
paths she is interested in buying, and a maximum amount she is is willing to pay for 
each path. The network service provider's objective is to assign single prices to the 
links such that the total revenue from customers who can afford to purchase their 
paths is maximized. Recently, numerous papers have appeared on the computational 
complexity of such pricing problems [H El El [H HUl UHl HU [HI HH [8] - 

A special case of this problem, where each customer is interested in purchasing 
only a single path (single-minded), and where there is no upper bound on the number 
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of customers purchasing each link (unlimited supply) was studied by Guruswami et 
al. [T5], under the name of tollbooth problem. The authors of [15] showed that the 
problem is already APX-hard when the network is restricted to be a tree, and also 
presented a polynomial time algorithm for the case when all paths start at a certain 
root of the tree. In [TS], the authors also studied the highway problem, a further 
restriction where the tree is a path, and gave polynomial time algorithms when either 
the budgets are bounded and integral, or all paths have a bounded length. 

In this paper, we continue the study of these problems. For the tollbooth prob- 
lem, the best known approximation factor was 0(logn + logm), where n and m are 
respectively the number of edges of the tree and the number of customers. This result 
applies in fact for general sets [15] , and not necessarily paths of a network, and even in 
the non single-minded case [1] . Very recently, and more generally, Cheung and Swamy 
[8] gave an algorithm that, given any LP-based a- approximation algorithm for max- 
imizing the social welfare under limited supply, returns a solution with profit within 
a factor of a\ogu max of the maximum, where u max is the maximum supply of an 
item. In particular, this gives an 0(logm)-approximation for the tollbooth problem 
on trees. In this paper, we give an 0(logn)-approximation which is an improvement 
over the O(logm) since n < 3m can be always assumed. We also show that if all the 
paths are going towards a certain root, then a (1 — e)-approximation can be obtained 
in quasi-polynomial time. This result extends a recently developed quasi-PTAS [lOj 
for the highway problem, and uses essentially the same technique. However, there is a 
number of technical issues that have to be resolved for this technique to work on trees; 
most notably is the use of the Separator Theorem for trees, and the modification of 
the price-guessing strategy to allow only for one-sided guesses. 

The existence of a quasi-PTAS for the highway problem indicates that a PTAS or 
even an FPTAS is still a possibility, since the problem was only known to be weakly 
NP-hard [6j. In the last section of this paper, we show that the highway problem is 
indeed strongly NP-hard and hence admits no FPTAS unless P=NP. 

Balcan et al. [3] considered a model in which some items can be priced below 
zero (in the form of a discount) so that the overall profit is maximized. They gave a 
4-approximation for the uniform budgets case, and a quasi-PTAS for a special case 
in which there is an optimal pricing that has only a bounded number of negatively 
priced items. Here we show that the existence of a quasi-PTAS in the general case is 
highly unlikely, by showing that the problem is APX-hard. 

In the next section, we give a formal definition of the problem. In Section [3] we 
give a O(logn) approximation for trees and in Section 2] we give a quasi-PTAS for 
the case of uncrossing paths. We conclude in Sectional 

2 The tollbooth problem on trees 
2.1 Notation 

Let T = (V, E) be a tree. We assume that we are given a (multi)set of paths Z = 
{I±, . . . ,I m }, defined on the set of edges E, where Ij = [sj,tj] C E is the path 
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connecting Sj and tj in T. For Ij G X, we denote by B(Ij) G M + the budget of path 
i.e., the maximum amount of money customer j is willing to pay for purchasing 
path Ij. In the tollbooth problem, denoted henceforth by Tb, the objective is to assign 
a price p(e) G M+ for each edge e G E, and to find a subset J C X, so as to maximize 

subject to the budget constraints 

p(I)<B{I), for all Je J, (2) 

where, for I G X, p(J) = See/^( e )- 

For a node u> G V, let I[w] C I be the set of paths that pass through w. In 
section [U we will assume that the tree is rooted at some node r G V . The depth of T, 
denoted d(T), is the length of the longest path from the root r to a leaf. For a node 
w G V, we denote by T(w), the subtree of T rooted at w (excluding the path from 
the parent of w to r), and for a subtree T' of T we denote by V(T'), E(T') and X(T') 
the vertex set, edge set, and set of intervals contained completely in T', respectively. 

2.2 Preliminaries 

In the following sections, we denote by p* : E \— > M + an optimal set of prices, and 
by Opt C X the set of intervals purchased in this optimum solution. For a subset of 
intervals X' C X, and a price function p : E i— > M + , we denote by p(X') = Yliei'P(I) 
the total price of intervals in X'. 

It easy to see that n < 3m may be assumed without loss of generality. Indeed, 
if we root the tree at some vertex r, then for every vertex v G V, we may assume 
that there is either an interval I G X beginning at v or an interval / G X that passes 
through two different children of v; otherwise, every interval through v must contain 
its parent u (unless v = r in which case all edges incident to r can be contracted), 
and hence we can contract the edge e = {u, v} and increase by p*(e) the prices of 
each the edges {v, v'} for each child v' of v. 

Let e > be a given constant. 

Proposition 1 ( |10| ) Let p* be an optimal solution for a given instance of Tb, and 
e > be a given constant. Then there exists a price function p : E i— ► ]R + for which 

(i) p(e) G {0, 1, • • • , P}, for every e G E, where P = nm/e, 

B( I) 

(ii) p(I) < -yi^ f or every I G Opt, and 
(Hi) p(Opt) > (1 - 2e)p* (Opt). 

We shall call the set of prices p satisfying the conditions of Proposition [H e-optimal 
prices. 

We will make use of the following well-known separator result for trees. 
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Proposition 2 Let T = (V, E) be a tree. Then there exists a node v ( called sepa- 
rator node) with the following property: Let s±, . . . ,s r be the sizes of the components 
obtained by deleting v from T, then there is a subset S C [r] such that 



Such a separator can be found in linear time. 

This gives a recursive partitioning of T in the following standard way: Let vq be 
a separator vertex in T and T\, . . . , T r be the components of T — vq. Recursively, find 
separator vertices v\, . . . v r in Ti, . . . , T r . We say that node vq has level(uo) = 1, nodes 
vi, . . . ,v r have level 2, and in general if node v is a separator vertex in the subtree T' 
obtained by deleting one-higher level separator vertex v' then level(v) = level(v') + 1. 
By ([3]), the maximum number of levels k in this decomposition is at most log 3 / 2 n. 
We shall denote by M(T) the set of separator nodes used in the full decomposition 



3 An O(logn) approximation for the tollbooth problem 
on trees 

In this section, we prove the following theorem. 

Theorem 1 There is a deterministic O (log n)- approximation algorithm for Tb. 

The proof goes along the same lines used in [2] to obtain an 0(logn)-approximation 
for the highway problem. The algorithm consists of 3 main steps: Partitioning, 
"randomized cut" , and then dynamic programming. We can then derandomize it to 
obtain a deterministic algorithm. 

We say that the given set of paths Z is rooted, if all the paths in T start at some 
node r, called the root of T. We will also make use of the following theorem. 

Theorem 2 ( |15| ) The tollbooth problem on rooted paths can be solved in polynomial 
time using dynamic programming. 

For i = 1, . . . , k, let 

I(i) = {I € I : i is the smallest level of a separator vertex v € W(T) contained in /}. 

Then I = Uj g u.]I(i) and In J = for all I, J G that contain distinct separators at 
level i. Let (Opt,^*) be an optimal solution. Then, p*(Opt) = Yli=i P*(OPTn J(i)). 
Thus if we solve k independent problems on each of the sets T(i), i = 1, . . . , k, and take 
the solution with maximum revenue, we get a solution of value at least p*(OPT)/k. 
Thus it remains to show the following result. 




(3) 



of T. 



4 



Theorem 3 Let v be a node o/T, and suppose that all the paths in X go through v. 
Then a solution {J,p) of expected value p{J) > p*(Opt)/8 can be found in polynomial 
time. 

Proof. Let v±,...,v r be the nodes adjacent to v. Note that each path I El can be 
divided into two sub-paths starting at v; we denote them by I\ and 12- We use the 
following procedure. 

1. Let X C {v\, . . . ,v r } be a subset obtained by picking each v% randomly and 
independently with probability 1/2. 

2. Let I' = {Ij 6l: j 6 {1, 2}, Ij contains exactly one vertex of X}. 

3. Use dynamic programming (cf. Theorem [2]) to get an optimal solution (J~,p) 
on the instance defined by X' and the tree T" with root v and sub-trees rooted 
at the children in X. 

4. Extend p with zeros on all the other arcs not in T', and return (J~,p). 

Let (Opt,p*) be an optimal solution. We now argue that the solution returned by 
this algorithm has expected revenue of p*(Opt)/8. Clearly, for every J el, either 
P*(h) > P*0O/2 or p*{l2) > P*(-0/2; let us call this more profitable part by I*. 
Then ~}2 Ie0pT p*{I*) > p*(Opt)/2. Let Opt' = {/ € Opt : 1* contains exactly 
one vertex of X}. Note that with probability at least 1/4 each / € Opt has I* 
intersecting the random set X in exactly one vertex. In particular, 

e[p*(opt')] = np*(Q] >\H ^ ^*(° pt )- 

jgOpt /gOpt 

Since what our procedure returns is at least as profitable as this quantity, the theorem 
follows. □ 

The randomized algorithm above can be derandomized using the method of pair- 
wise independence [171 [TBI E] ■ 

4 Uncrossing paths 

Here we assume that the tree is rooted at some node r € V, and that paths in I have 
the following uncrossing property: If I = [s,t] £ X then t lies on the path [s,r]. This 
property implies that once paths in I meet they cannot diverge. 

In the course of the solution, we shall consider the following generalized version 
of the problem: Given intervals as above, and also a function h : 1 1— » M + , find JCI 
and a pricing p : E \—> R +) satisfying ([2]) and maximizing J^/gj h(I,p). 

Given a price function p : E i— > M + and a node io £ V, the accumulative price 
at any node it on i/ie pai/i [io,r] with respect to it; is defined as p([w,u\). Obviously, 
this monotonically increases as u moves towards the root. In this section we prove 
the following theorem. 
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Theorem 4 There is a quasi-polynomial time approximation scheme for the tollbooth 
problem with uncrossing paths. 

In the following, we fix K = [log (nP) j log (1 + e)~|. 

Definition 5 (e-Relative pricings) Let w £V be a given node of T, and < k < K 
and < k! < 2 log 3 / 2 n> be given integers. We call any selection ofk nodes Ui,...,Uk € 
V, k indices —oo < i\ < ■ ■ ■ < ik < K, and k' values pi, ■ ■ ■ ,Pk' £ {0, 1, . . . , nP}, 
such that w,ui,U2, ■ ■ ■ , itfe,r lie on the path [w,r] in that order, an e-relative pricing 
w.r.t. w, and denote it by (w, k, k' , u\, . . . , u^, i±, . . . , ik,Pi, ■ ■ ■ ,Pk')- 

The total number of possible e-relative pricings with respect to a given w € V is at 
most 

L = (d(T)K) K (nP + l) 2 log s/2 n , (4) 
which is 77i polylog ( m ) for every fixed e > 0. 

Definition 6 (Consistent pricings) Let R = (w,k,k' ,Ui, . . . ,ut-,ii, ■ ■ ■ ■ ■ ■ ,Pk') 

be an e-relative pricing w.r.t. node w G V, £ = {s\, . . . , sy} be the set of separators 
from JV(T) on the path from (w, r], and p : E i— » M + be a pricing of E. We say that 
R is e -consistent with p and C if 

(CI) for j = 1, . . . , k — 1, (1 + e) 1 ? < p([w, u]) < (1 + e)^ +1 if u lies in the interval 
[uj ,Uj + i) ( excluding Uj+i), 

(C2) for j = l,..., k', p{[w, 8 j\) = pj. 

Lemma 1 Let p : E i— > R + be an e- optimal pricing for a given instance of Tb, 
w € V be an arbitrary node, and C = {s\, . . . , s^} be the set of separators in JV(T) 
on the path from (w,r\. Then there exists an e-relative pricing R w.r.t. w, that is 
e-consistent with p and C. 

With every e-relative pricing R, we can associate a system of linear inequalities, 
denoted by S(R), on a set of E variables {p(e) : e € E}, consisting of the constraints 
(CI) and (C2), together with the non-negativity constraints p(e) > 0. The feasible 
set for this system gives the set of all possible pricings with which R is e-consistent. 
For two systems of inequalities S±,S2, we denote by Si A S2 the system obtained by 
combining their inequalities. 

Let R = (w, k, k! , u\, . . . , Ufc, i\, . . . , i[, . . . , i' k ,) be an e-relative pricing w.r.t. a 
node w £ V. Given an interval / € T[w], we associate a value v(L,R) to /, defined 
with respect to R as follows: Let j(L) be the largest index such that contained 
in /. Then, define v(L,R) = (1 + e) J W. For a subset of intervals T 1 C Z, we define, 
as usual, v(I',R) = V (L , R). It follows that for any e-relative pricing R w.r.t. 
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a node w G V, any p : E ]R + with which i? is consistent, and any / = [s, t] G 
we have 

<p(Kt]) < (1 + £)«(/, 12). (5) 

Decomposition into two subproblems. Let w € Af(T) be a separator node. 
Then T can be decomposed into two subtrees Tl = (Vl,El) and Tr = (Vr,Er), 
such that the root r G Vr and to is the root of T^. We define two Tb instances 
(T L ,X L ) and (T R ,1 R ) where: 

J = {[ s ,t] G : s e Vl and t G Vr}, 

T l = {M]el : s,teV L }U{[s,w] : [s,t]£l }, 

1 R = {[s,t] £1 : s,t€V R }. 

In other words, the intervals passing through w, crossing from to Tr are truncated 
in Tx, while all other intervals remain the sam^]. Note that from the choice of w, we 
have max{|V(Ti)|, |V(T fl )|} < Tp + 1, and both instances (T L ,X L ) and (T R ,X R ) are 
of the uncrossing type, with roots w and r, respectively. 

The algorithm is shown in Figure [TJ It is initially called with an empty S, and 
with h(I) = for all J £ I, The procedure iterates over all e-relative pricings R, 
consistent with S, w.r.t. the middle edge e* , then recurses on the subsets of intervals 
to the left and right of e*. Intervals crossing from to Tr will be truncated and 
their values will be charged to T^; hence the corresponding budgets are reduced, and 
the corresponding /i-values are increased. 

Solving the base case. At the lowest level of recursion (either line 1 or 4), we 
have to solve a linear program defined by the system S. Note that the system may 
contain constraints on variables outside the current set of edges E of the current tree 
T (resulting from previous nodes of the recursion tree). However, we can reduce this 
LP to one that involves only variables in E. Indeed, any constraint that involves a 
variable not in E, has the form L < p([w,u]) < U, where u G V(T), and w ^ V(T) 
is a separator node such that there is another separator node w' G V(T) on the path 
from w to u. Then when w' was considered in the recursion, a constraint of the form 
p([w,w')] = q, for some value q, was appended to S (recall (C2) in the definition 
of consistent pricings). Now, we can replace the first constraint by the equivalent 
constraint L — q < p([w',u]) < U — q, which only involves variables from E. This is 
exactly what procedure REDUCE(5, •) does in lines 2 and 6. 

When the procedure returns, we get a pricing p : E i— > M + and a set of intervals 
J C X which can be purchased under this pricing. 
Theorem [5] follows from the following two lemmas. 

Lemma 2 Algorithm Tb runs in quasi-polynomial time in m, for any fixed e > 0. 

Lemma 3 For any e > 0, Algorithm Tb returns a pricing p and a set of intervals J 
such that p(I) < B(I) for all I G J and p(J) > (1 - 3e)p*(OPT). 

1 throughout, we will make the implicit assumption that each interval has an "identity"; so, for 
instance, Tl nXo will be used to denote the set {I £ To : I = [s,t] and [s,w] £ Tl} 
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Algorithm Tb(T,J, r, B, h, S): 

Input: An uncrossing Tb instance (T = (V, E),T) with root r, 

budgets and values B,h :T <—> R+, and a feasible system of inequalities S 
Output: A pricing p : E R + and a subset JCI 

1. if \1\ = 0, then 

2. S' ^REDUCE(5, E) 

3. return (p, 0), where p is any feasible solution of S' 

4. if d(T) = 1, then 

5. for each edge e of T do 

6. S' ^REDUCE(S,{e}) 

7. P(e)^axgrnax{Y J i e x :p ,<B(i)(K I )+P') : p' satisfies S'} 

8. J(e)^{/eX:B(/)>p(e)} 

9. return ((p(c) : e £ £), U eSB .7(e)) 

10. let w be a separator node of T and Tl, Tr,^o,^l,^r be as defined above 

11. for every e-relative pricing R w.r.t. w for which S A S(R) is feasible do 

12. for each I G Xo do 

13. B(J) -(1 + £)«(/, iJ) 

14. ft(7) <- h(I) +v(I,R) 

15. (p 1) J 1 )^TB(T i ,J i ,ii;, J B,/i,5) 

16. faiJn) «- TB(T K ,I Jl ,r,fl,/i J 5AS(ie)) 

17. let p be the pricing defined by p(e) = j>i(e) if e € El and p(e) = pi(e) if e G En 

18. J <— J1UJ2 

19. record (p, J7") 

20. return the recorded solution with largest p{J) + h(J) value 



Figure 1: The procedure for computing e- approximate prices. 
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5 Hardness of the highway problem 



5.1 Strong NP-hardness in the standard model 

Recall that the highway problem is the special case of the tollbooth problem when the 
underlying graph is a path. In [15j . Guruswami, et al. considered the highway problem 
and gave a polynomial time algorithm when the maximum budget is bounded by a 
constant, and all the budgets are integral. Balcan and Blum [2] gave a constant factor 
approximation algorithm when all intervals have the same length. Breist and Krysta 
[6] showed that the problem is weakly NP-hard. In [13], Grigoriev et al. showed 
that a restricted version of the problem when the prices are required to satisfy a 
monotonicity condition remains weakly NP-hard. In this section, we show that the 
problem is strongly NP-hard by a reduction from MAX-2-SAT. 

Consider a MAX-2-SAT instance with n variables {x\,--- ,x n } and m clauses 
{Ci, • • • ,C m }. Let the variables be numbered 1, • • • ,n. We construct a gadget for 
each variable and each clause. We start by describing the gadgets in our construction. 

5.1.1 Variable Gadget 

The variable gadget for each variable consists of two copies of the following basic gadget 
and a consistency gadget. We first describe the basic gadget, and then describe the 
consistency gadget and the construction of a variable gadget. 

Basic Gadget: The basic gadget consists of 4 edges ei, ••• , e^, and 4 types of inter- 
vals A, B, C and D. There are 4 intervals each of type A and B, labeled a\, ■ ■ ■ , a^, 
and ,64 respectively. The intervals at = b% = [ei], i = l,---4. The intervals 

a\, ■ ■ ■ ,04 have budgets of 1, 2, 2, 1 respectively, and the intervals b±, ■ ■ ■ ,64 have bud- 
gets 2, 1, 1,2 respectively. There are 2 type C intervals, c\ and C2, with c\ = [ei,e2], 
and C2 = [63,64]. These intervals have a budget of 3. There are two intervals of 
type D, d\ = d2 = [e2, e.3] with d\ having a budget of 4, and efo, a budget of 2. The 
basic gadget is shown in Figure [21 We now show that there are exactly two price 
assignments for {ei, • • • , e^} that gives us optimum profit. 

Lemma 4 The maximum profit that can be obtained from a basic gadget is 18, and 
there are exactly two sets of prices that achieve this profit. 

We call the price assignment (1,2,2,1) to the edges e\,--- ,e± respectively, a 
TRUE assignment, and the price assignment (2,1,1,2) to the edges e\,--- re- 
spectively, a FALSE assignment. The variable gadget is constructed on 8n + 1 edges 
(e 4 n.) C4 n _i, • ■ ■ , e±, h, fi, ■ ■ ■ , fin), where n is the number of variables in the MAX-2- 
SAT instance. Each variable gadget consists of two copies of the basic gadget, along 
with a consistency gadget. The consistency gadget ensures that the two basic gadgets 
have the same price assignment, i.e., both set to TRUE, or both set to FALSE. More 
formally, let {x\, ■ ■ ■ ,x n ) be an order on the variables of the MAX-2-SAT instance. 
Then, the gadget for variable Xi, consists of two basic gadgets, B\ and Bf. B\ con- 
sists of intervals (customers) interested in the edges eu-z , • •• ,ea and Bf consists of 
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Figure 2: A basic gadget. The gadget consists of 4 edges, and 4 types of intervals 
A,B,C and D. The interval labels are shown below each interval, and the budgets 
are shown above each interval. 



intervals interested in the edges fus,--- ,fa. Finally, the intervals ensuring con- 
sistency of the gadget for variable x% spans from e^i-i,--- , f^s- The consistency 
gadget consists of a single interval that has a budget of mn 2 + 6(2i — 2) + 6. Finally, 
we add a new type of interval, called a type H interval that is interested only in the 
edge h, and has a budget of mn 2 . 

Figure [3] shows the arrangement of the variable gadgets. We now show that the 
consistency intervals do their job. i.e., if for a variable gadget, Bj and B 2 have 
different price assignments, we obtain a smaller profit than when they are the same. 



mn 2 + 12 + 6 



e 7 e 6 

X2 



e 5 e 4 



e3 e 2 ei 

Xi 



mn 

H 

h 



fx f-2 h fi h fe h h 

Xi x 2 



Figure 3: The variable gadget. 



Lemma 5 The maximum profit of 2mn 2 + 6(2i — 2) + 6 + 36 from a variable gadget 
and the interval h is achieved only when both the basic gadgets corresponding to a 
variable are consistent, and the type H interval purchases edge h at a price of mn 2 . 

We will create several copies of the basic gadgets, the consistency gadgets for each 
variable as well as several copies of the H interval to ensure that in an optimum price 
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Clause 


Interval 


Budget 


(xi V xj) 


[e4i-3 5 /4j-3] 


mn 2 + 6(i + j - 2) + 3 


(x~ V Xj) 


[ e 4i-l> /43-3J 


77177 2 + 6(i + j — 2) + 6 


(xi V xj) 


[e4i-3 5 /43-1] 


mn 2 + 6(i + j - 2) + 6 


(xJVxj) 


[e4i-l, /4j-l] 


mn 2 + 6(i + j - 2) + 9 



Figure 4: This table shows the lengths and budgets of the intervals making up a 
clause gadget for the four different kinds of clauses. 

assignment, the basic gadgets are consistent, and the reduction goes through. But 
before we do this, we describe the clause gadgets. 

5.1.2 Clause Gadgets 

The clause gadget for a clause of variables Xi and Xj runs between the basic gadget 
B\ and B 2 . There are four types of clause gadgets corresponding to the four types of 
clauses. Each clause gadget consists of one interval. These intervals have the property 
that we obtain a certain revenue from the clause interval if and only if the clause is 
satisfied; otherwise we obtain nothing. The clause gadgets for the four types of clauses 
are shown in Table U] and in Figures El El El and [9] in the Appendix. 

We say that a pricing is consistent if for every variable, the price assignment to 
the two basic gadgets of the variable gadget are both TRUE or both FALSE, and the 
consistency intervals spend their entire budgets. 

Lemma 6 Consider a clause C consisting of variables Xi and Xj and a consistent 
price assignment to the edges. Then, the intervals corresponding to C will be able to 
purchase their desired edges if and only if the corresponding truth assignment to the 
variables satisfies the clause C. 

5.1.3 NP-hardness 

We now describe the final reduction. As mentioned earlier, we have to create copies of 
the variable gadget, consistency gadget and the H interval for the proof to go through. 
We make T copies of each basic gadget, of each consistency gadget, and of the H 
interval, where any value of T, larger than m 2 n 3 will suffice for the proof. Observe 
that for a variable gadget again, the profit maximizing prices achieve consistency of 
the variable gadget, and making T copies of the H intervals ensures that the price of 
the edge h is set to mn 2 . 

Theorem 7 The highway problem is strongly NP-hard. 
5.2 APX-hardness in the discount model 

Theorem 8 The highway problem with negative prices is APX-hard, even restricted 
to instances in which one edge is shared by all customers. 
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Ix B=2 2x B=l and lx B=3 lOx B=l and lOx B=2 

Figure 5: Reduction: Each edges e is partitioned in edges e\ (fat) and &2 (dotted). 
Which is which is arbitrary. The notation lx B=2 means that there is one customer 
with budget 2 on this type of edge. 

Proof. We will show that the problem is equivalent to a pricing problem on 
bipartite graphs and prove that the latter problem is APX-hard. Assume we are 
given an instance of the highway problem in which edge e is contained in each of 
the intervals. We split edge e by adding a node vq on e. This has no effect on the 
problem. We construct a bipartite graph H with one set consisting of the points left 
of vo and the other set consisting of the points right of vq. Now, an interval containing 
e becomes an edge in the bipartite graph. The items are the vertices and a customer 
is interested in the two items on the vertices of its edge. Given a pricing p for the 
highway instance, we define the pricing q p of H by letting the price of a vertex v in 
H be the cumulative price p([vq,v]). Conversely, for any pricing q of the vertices of 
H there is a corresponding pricing p of the highway problem such that q = q p . 

We prove that the pricing problem on bipartite graphs is APX-hard by a reduction 
from maxcut on 3-regular graphs. Given a 3-regular graph G = (V,E), we make it 
bipartite by placing an extra vertex v e on every edge e, dividing it in two new edges e% 
and 62- For e\ there is one customer with budget 2 and for e2 there are two customers 
with budget 1 and one customer with budget 3. We define one extra vertex vq and 
define for each v £ V an edge a v = (v,vo). For each such edge a v there are ten 
customers with budget 1 and ten customers with budget 2. The bipartite graph 
partitions into V and the new vertices V = {v e \ e £ E} U {vq}. (To enhance reading 
we write prices and budgets in decimal and amounts of customers in words.) 

Consider any pricing p of the bipartite graph. We may assume that p(vo) = 
since subtracting p(vq) from all vertices in V' and adding p{vq) to all vertices in V 
does not change the profit. We will have to take into account though that prices may 
be negative. 

Next, we argue that in any optimal solution the price p(v) of any vertex v 6 V 
is either 1 or 2. Denote by p(e) the profit we get from the customer on e\ plus the 
three customers on e2- It is easy to see that 3 < p{e) < 5 for any edge e. Suppose 
p{v) = a ^ {1, 2}. If q > 2 or a < then the profit on a v is 0. By changing the price 
to p{v) = 2 the profit becomes ten times 2 is 20. The maximum profit on any of the 
three adjacent edges e is 5. Hence, we gain 20 and loose at most 15. Now assume 
< a < 1. We raise the price of v to price p(v) = 1 and reduce the price on the 
vertices v e by 1 — a for each of the three adjacent edges e of v in G. The 20 customers 
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on a v add an extra 20(1 — a) to the profit. No other customer sees an increase of its 
bundle price and at most 9 customers will see a reduction of the bundle price. Hence, 
we loose at most 9(1 — a) on them. Now assume 1 < a < 2. We raise the price to 
p(v) = 2 and reduce the price on the vertices v e of adjacent edges e by 2 — a. The 
argument is the same: We gain 10(2 — a) and loose at most 9(2 — a) since at most 9 
customers will see their bundle price drop. 

We showed that there is an optimal pricing in which p(vq) = and p{v) € {1, 2} 
for all v € V. Next we prove that there is a cut of size k in G if and only if the 
maximum profit is 20|V| + 4:\E\ + k. Given a cut of size k we price the vertices on 
one side 1 and on the other side 2. Now consider an edge e = (y,w) € E with 
p(v) = p{w) = 1. We can get a profit p(e) = 4 by setting p(v e ) = or p(v e ) = 1. 
This is also the maximum profit possible. Similarly, if p(v) = p{w) = 2 then we 
can get the maximum profit p(e) = 4 by setting p(v e ) = 0. Finally, if p(v) = 1 and 
p{w) = 2 then we can get the maximum possible profit p(e) = 5 by setting p(v e ) = 
or p(v e ) = 1, depending on how we chose e± and &i. From the customers on edges 
adjacent to vo we get the maximum profit of 20 per edge. The total profit is exactly 
20|y| + 5/c + 4(|i?| — k) = 2Qn + A\E\ + k and this is maximum possible if the maximum 
cut is k. The reduction is gap-preserving since 3|V| = 2\E\ and \E\ <2k. □ 

In the reduction we showed that there always is an optimal pricing of the bipartite 
graph with only non-negative prices. Hence, the bipartite graph pricing problem 
remains APX-hard if we restrict to non- negative prices. 

Corollary 1 The graph pricing problem is APX-hard on bipartite graphs and all 
budgets in {1, 2, 3}. This holds for the non-negative version as well as for the version 
with negative prices allowed. 

Guruswami et al. [15] show that the graph-pricing problem is APX-hard even if all 
budgets are 1. Note that the bipartite case is trivially solved in that case by setting 
a price of 1 to all items on one side. 

6 Conclusion 

In this paper, we presented an 0(logn)-approximation algorithm for the tollbooth 
problem on trees, which is better than the upper bound currently known for the 
general problem. Improving this bound is an interesting open problem. One plausible 
direction towards this is to use as a subroutine, the quasi-polynomial time algorithm 
for the case of uncrossing paths. Such techniques have been used before, for example 
for the multicut problem on trees [12] . However, it is unclear how a general instance of 
the Tb problem can be decomposed into a set of problems of the uncrossing type. For 
the highway problem, the strong NP-hardness presented in this paper shows that the 
problem is almost closed, modulo improving the running time from quasi-polynomial 
to polynomial. 



13 



Acknowledgements: We would like to thank Naveen Garg for suggesting to use 
the separator theorem in the proof of Theorem [TJ and Chaitanya Swamy for helpful 
remarks. 



References 

[1] G. Aggarwal and J. D. Hartline, Knapsack auctions, SODA '06: Proceedings 
of the seventeenth annual ACM-SIAM symposium on Discrete algorithm (New 
York, NY, USA), ACM Press, 2006, pp. 1083-1092. 

[2] M. F. Balcan and A. Blum, Approximation algorithms and online mechanisms 
for item pricing, EC '06: Proceedings of the 7th ACM conference on Electronic 
commerce (New York, NY, USA), ACM Press, 2006, pp. 29-35. 

[3] M.-F. Balcan, A. Blum, H. Chan, and M. Hajiaghayi, A theory of loss-leaders: 
Making money by pricing below cost, WINE, 2007, pp. 293-299. 

[4] M. F. Balcan, A. Blum, and Y. Mansour, Item pricing for revenue maximization, 
EC '08: Proceedings of the 9th ACM conference on Electronic commerce, to 
appear (New York, NY, USA), ACM Press, 2008. 

[5] M.F. Balcan and A. Blum, Approximation algorithms and online mechanisms for 
item pricing, Theory of Computing 3 (2007), 179-195. 

[6] P. Briest and P. Krysta, Single-minded unlimited supply pricing on sparse in- 
stances, SODA '06: Proceedings of the seventeenth annual ACM-SIAM sympo- 
sium on Discrete algorithm (New York, NY, USA), ACM Press, 2006, pp. 1093- 
1102. 

[7] , Buying cheap is expensive: Hardness of non-parametric multi-product 

pricing, Proc. 17th Annual ACM-SIAM Symposium on Discrete Algorithms, 
ACM-SIAM, 2007. 

[8] M. Cheung and C. Swamy, Approximation algorithms for single-minded envy-free 
profit-maximization problems with limited supply, to appear, FOCS, 2008. 

[9] E. D. Demaine, M. T. Hajiaghayi, U. Feige, and M. R. Salavatipour, Combi- 
nation can be hard: approximability of the unique coverage problem, SODA '06: 
Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algo- 
rithm (New York, NY, USA), ACM Press, 2006, pp. 162-171. 

[10] K.M. Elbassioni, RA. Sitters, and Y. Zhang, A quasi-PTAS for profit- 
maximizing pricing on line graphs, ESA (L. Arge, M. Hoffmann, and E. Welzl, 
eds.), Lecture Notes in Computer Science, vol. 4698, Springer, 2007, pp. 451-462. 

[11] P. W. Glynn, B. Van Roy, and P. Rusmevichientong, A nonparametric approach 
to multi-product pricing, Operations Research 54 (2006), no. 1, 82-98. 



14 



[12] D. Golovin, V. Nagarajan, and M. Singh, Approximating the k-multicut problem, 
SODA, 2006, pp. 621-630. 



[13] A. Grigoriev, J. van Loon, R. Sitters, and M. Uetz, How to sell a graph: Guide- 
lines for graph retailers., WG, 2006, pp. 125-136. 

[14] A. Grigoriev, J. van Loon, M. Sviridenko, M. Uetz, and T. Vredeveld, Bundle 
pricing with comparable items, ESA, 2007, pp. 475-486. 

[15] V. Guruswami, J. D. Hartline, A. R. Karlin, D. Kempe, C. Kenyon, and F. Mc- 
Sherry, On profit-maximizing envy-free pricing, SODA '05: Proceedings of the 
sixteenth annual ACM-SIAM symposium on Discrete algorithms (Philadelphia, 
PA, USA), Society for Industrial and Applied Mathematics, 2005, pp. 1164-1173. 

[16] J. D. Hartline and V. Koltun, Near-optimal pricing in near-linear time, Algo- 
rithms and Data Structures - WADS 2005 (F. K. H. A. Dehne, A. Lopez-Ortiz, 
and J.-R. Sack, eds.), Lecture Notes in Computer Sciences, vol. 3608, Springer, 
2005, pp. 422-431. 

[17] M.Luby and A. Wigderson, Pairwise independence and derandomization, Foun- 
dations and Trends in Theoretical Computer Science 1 (2005), no. 4, 237-301. 

[18] R. Motwani and P. Raghavan, Randomized algorithms, Cambridge University 
Press, 1995. 



15 



Appendix A: Proofs 

Proof of Lemma[TJ Let R = (w, k, k' ,u±, . . . , Uk,h, . . . ,ik,Pi, ■ ■ ■ ,Pk') be denned as 
follows: write uq = w and i$ = — oo, and let ij and Uj, for j = 1, 2, . . ., be respectively 
the smallest non-negative index and the closest node to Uj-\ on the path [itj-_i,r] 
with (1 + e) 4j_1 < p([w,Uj]) < (1 + e)* 3 '; k will be the largest such index j. Finally, 
for j = 1, let pj = p([w,Sj]). Note that k < K since (1 + e) k < nP, and 

k' < 21og 3( / 2 n since the number of separators on the path from any node to the root 
is at most 21og 3 / 2 ^- □ 

Proof of Lemma [2j The number of possible e-relative pricing is at most L, given 
in ([3]). This gives the recurrence 

2ra 

T(n) < poly(n, m) + 2L ■ T( — ). 

3 

for the running time. Thus T(n) < L°( logn ) poly(m) and the lemma follows. □ 



Proof of Lemma [3j Let (pt,<Jt) be the solution returned by the algorithm when 
the input is (T,2t, i"t, -Bt> <St)- We show by induction on the depth of the 
recursion tree that, if there exists a pricing p' T satisfying 5t, and a set Jl^ such that 
p' T (I) < B T (I)/(1 + e) for all / € J T , then 

(i) Pt(I) < for all / € and px is feasible for St', and 

(ii) Pt(Jt) + h T {J T ) > + h T {J T ). 

The statement of the theorem follows from (i) and (ii) by taking, at the highest 
level where hi (I) = for all /, p' T = p (an e-optimal pricing) and J T = Opt. 
Base case. At a leaf of the recursion tree, we either have \X\ = in which case (i) 
and (ii) are trivially satisfied, or d(T) = 1 in which case (i) and the stronger version 
of (ii), pt(Jt) + MvTt) > Pt(^t) + ^t^t), are insured by the computation in line 
7. 

General recursion level. Let w,Tl,Tr,Xo,Xl,Xr be as defined in line 10 at the 
current level, and pi,P2,<Ji and Ji the returned pricings and sets at lines 15 and 
16. Let i?x be an e-relative pricing consistent with p' T . Then the restrictions p Ti 
and p' T of p' T on Ti and satisfy, respectively, St l and St r - Moreover, for 

(I) 

any I € J T PI (Xl \ Xq), we have p' Tl (I) < ^ ; for any / € J T n Ir, we have 
Pt r (I) — ~T+i~ > an< ^ ^ or an y ^ = t s '*] e <^t ^ we have p' Tl (I) = p' T ([s,w]) = 

p' T ([s,t}) - p' T ([w,t}) < p' T ({s,t}) - v(I,R T ) < - v(I,R T ) = where 

the first inequality follow from ([5]), and the last equation follows from line 13 of the 
procedure. Thus we can apply the induction hypothesis to the two subproblems, and 
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hence get that 



pi CO 
Mi) 

Pi(Ji) + h TL (Ji) 
P2(J2) + h TR (J 2 ) 

and both p\ and p 2 , and hence pt, satisfy <St- By ([6]) and (J7|), we have pt(I) < B^(I) 
for all I € JiU J2 \Io- By (jSJ) and line 13 of the procedure, we also have pi([s, w]) < 
B T (I) - (1 + e)v(I, R T ) for all I = [s, i]el n Ji. Since p 2 satisfies 5(i? T ) (c.f. line 
16), and hence is e-consistent with R^, we get by ([5]) that p 2 ([w, t]) < (1 + c)v(I, Rt) 
for all / = [s,i] G Xo. Combining this with the above inequality gives pr(I) = 
pi[s,w] + p 2 [w,t] < Bt(I) for all / = [s,t] G Iq D i7i, and hence proves (i). 

Now we prove (ii). We have the following: p' T (J^ nX^ \2o) = Pt(^t ^I-l \Zq), 

h TL {j^ ni L \i ) = h T (j^ni L \i ), p ' Tr {jJj. n i R ) = P ' T n i R ) , h TR (j^ n i R ) = 

h-T^r nXij), and 

PT L (^T n:z o) = 

/itJ^tHJo) = 

> 

where the last inequality follows by (JSJ. Summing all these together gives (ii) and 
concludes the proof of the lemma. □ 

Proof of Lemma |4j Consider the pair of intervals {a>i,bi} for each i = 1, • • • ,4. 
The maximum profit that can be obtained from such a pair is 2, which is obtained 
by setting either p(ei) = 1, or p{ei) = 2. Any other price clearly yields a smaller 
profit. Similarly if we consider only the intervals of type D, the maximum profit 
is obtained by setting p(e 2 ) + p{e%) = 2, or p(e 2 ) + p{e%) = 4. This gives us 4 
price vectors that give us maximum profit from all except the type C intervals, viz. 
(1, 1, 1, 1), (2, 2, 2, 2), (1, 2, 2, 1), (2, 1, 1, 2). In the first case, we only obtain a profit of 
4 from the type C intervals for a total profit of 16, while in the second case, we exceed 
the budget of both the type C intervals giving us a profit of only 12. Thus there are 
only two profit maximizing price assignments. □ □ 

Proof of Lemma[5j Consider the gadget for variable X{. If the gadget is consistent, 
we see that both the consistency gadget, and the type H interval spend their entire 



< B Tl {I) for all Ig Ji, 

< B Tr (I) for all I G J 2 , 



> 



> 



1 + e 

p' TR {j^r\i R ) 

l + e 



+ h TL {jJ c ni L ), 
+ h TR (J^ni R ), 



(6) 
(7) 

(8) 
(9) 



22 Pt([s,w]), 
i=[s,t]ej^rdo 

E MMD+ E v (i> r t), 

i=[s,t]eJ^mo i=[s,t] eJJ^rOo 

£ h T (I)+ £ P ' t(M) 



i=[s,t\ejLra 



l + e 
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budget, and we obtain a profit of 2mn 2 + 6(2* — 2) + 6 + 36. Suppose B\ is TRUE and 
Bf is FALSE. Then, we are forced to set the price of edge h to mn 2 — 1, otherwise the 
consistency gadget is unable to purchase it's edges and we lose at least mn 2 + 6(2? — 
2) + 6 from the total profit. However, by setting p(h) = mn 2 — 1, the maximum profit 
we obtain is at most (mn 2 — 1) + mn 2 + 6(2i — 2) + 36, which is smaller than the 
maximum profit by 1 unit. On the other hand, if B\ is FALSE, and Bf is TRUE, we 
lose 1 unit from the maximum profit since we cannot raise the price of edge h to more 
than mn 2 , and the consistency gadget is unable to spend it's entire budget. Hence, 
the maximum profit is obtained only when the variable gadget is consistent. □ 

Proof of Lemma [6j Consider a consistent price assignment, with the edge h having 
a price of n 2 and a clause (x{ V Xj). If the clause (xi V Xj) is not satisfied, then the 
gadgets for variables Xj and xj have a FALSE price assignment, and the prices for 
the edges in the gadgets for Xi and Xj are 2, 1, 1, 2, and 2, 1, 1, 2 respectively. Then, 
it is easy to see that the price of the bundle of the clause interval in this case is 
mn 2 + 6(i + j — 2) + 4, exceeding the budget of the clause interval. In the other three 
cases, the price of the bundle is at most mn 2 + 6(i + j — 2) + 3, and the profit from 
the clause interval is at least mn 2 + 6(i + j — 2) + 2 (In the case when both Xi and Xj 
are TRUE, the profit is mn 2 + 6(i + j — 2) + 2, in the two other satisfying assignments 
the profit is mn 2 + 6(i + j — 2) + 3). The proofs for the other types of clauses (xj V Xj), 
(xi Vxj), and (x7 Vxj) are similar. □ □ 

Proof of Theorem Suppose the instance of MAX-2-SAT has k satisfied clauses. 
We set the prices for the edges corresponding to the two basic gadgets corresponding 
to the variable Xi to TRUE if Xi = 1 and FALSE otherwise. We set the price of edge 
h to n 2 . This gives a total profit of 

n 

S = T • 36n + T • ^(mn 2 + 6(2i - 2) +6) +T • mn 2 + kmn 2 +0{kn) 

i=l 

The first term of the sum comes from the basic gadgets of each variable set to TRUE 
or FALSE, the second term comes from the consistency gadgets, the third term comes 
from the H intervals, and the last two terms, from the satisfied clause gadgets. 

To show the reverse direction, consider a price assignment that achieves a profit of 
at least S. We claim that in an optimal price assignment, the gadgets corresponding 
to the variables are all consistent, and the edge h has a price of mn 2 . Note first that 
the maximum profit we can gain from all the clauses is 0(m 2 n 2 ). Now, if we have 
larger than, say T = m 3 n 2 copies of each variable gadget, it follows from Lemma 
[5] that we only lose by making either the variable gadgets inconsistent, or if the H 
intervals and the consistency gadgets do not spend their entire budget. Hence, in the 
optimal solution, the variables are consistent, and h has a price of mn 2 . This then 
leaves only the clause intervals. Note that our profit maximizing pricing will try to 



18 



maximize the number of clause intervals satisfied, since the clause intervals differ by 
at most O(n) in their budgets, but their individual budgets themselves are at least 
mn 2 . By the obvious assignment of truth values to the variables from the variable 
price assignment, we get an assignment that satisfies k clauses. □ 
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Appendix B: The gadgets used in the NP-hard construc- 
tion 
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Figure 6: The clause gadget for clause (xj V Xj) 
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Figure 7: The clause gadget for clause (x{ V xj) 
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Figure 8: The clause gadget for clause (xj V xj) 
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Figure 9: The clause gadget for clause (xj V xj) 
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